Development of Acoustic Model for Croatian Language Using HTK

نویسندگان

Branimir Dropuljić

Davor Petrinović

چکیده

Paper presents development of the acoustic model for Croatian language for automatic speech recognition (ASR). Continuous speech recognition is performed by means of the Hidden Markov Models (HMM) implemented in the HMM Toolkit (HTK). In order to adjust the HTK to the native language a novel algorithm for Croatian language transcription (CLT) has been developed. It is based on phonetic assimilation rules that are applied within uttered words. Phonetic questions for state tying of different triphone models have also been developed. The automated system for training and evaluation of acoustic models has been developed and integrated with the new graphical user interface (GUI). Targeted applications of this ASR system are stress inoculation training (SIT) and virtual reality exposure therapy (VRET). Adaptability of the model to a closed set of speakers is important for such applications and this paper investigates the applicability of the HTK tool for typical scenarios. Robustness of the tool to a new language was tested in matched conditions by a parallel training of an English model that was used as a baseline. Ten native Croatian speakers participated in experiments. Encouraging results were achieved and reported with the developed model for Croatian language.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Croatian Large Vocabulary Automatic Speech Recognition

This paper presents procedures used for development of a Croatian large vocabulary automatic speech recognition system (LVASR). The proposed acoustic model is based on context-dependent triphone hidden Markov models and Croatian phonetic rules. Different acoustic and language models, developed using a large collection of Croatian speech, are discussed and compared. The paper proposes the best f...

متن کامل

The 1998 HTK Broadcast News Transcription System: Development and Results

This paper presents the development of the HTK broadcast news transcription system for the November 1998 Hub4 evaluation. Relative to the previous year’s system The system a number of features were added including vocal tract length normalisation; cluster-based variance normalisation; double the quantity of acoustic training data; interpolated word level language models to combine text sources;...

متن کامل

Dear reader ,

You have at your desk the issue no. 1/2010 of the journal AUTOMATIKA with which begins my direct responsibility for its future development in the capacity of the Editor-in-chief. I would like to thank the KOREMA presidency for entrusting me with this responsibility. Special thanks to my predecessor Prof. Borivoj Rajković for the enormous effort he invested in regular publishing of the journal a...

متن کامل

Lecture 8: Speech Recognition Using Finite State Transducers

In order to use HTK-trained speech recognition models with the AT&T speech recognition search engine, three types of conversion are necessary. First, you must convert the HTK-format hidden Markov models into ATT format acoustic models. Second, you’ll need to write finite state transducers for the language model, dictionary, and context dependency transducer. Third, acoustic feature files need t...

متن کامل

Hindi Speech Recognition System Using Htk

Speech recognition is the process of converting an acoustic waveform into the text similar to the information being conveyed by the speaker. In the present era, mainly Hidden Markov Model (HMMs) based speech recognizers are used. This paper aims to build a speech recognition system for Hindi language. Hidden Markov Model Toolkit (HTK) is used to develop the system. It recognizes the isolated wo...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Development of Acoustic Model for Croatian Language Using HTK

نویسندگان

چکیده

منابع مشابه

Croatian Large Vocabulary Automatic Speech Recognition

The 1998 HTK Broadcast News Transcription System: Development and Results

Dear reader ,

Lecture 8: Speech Recognition Using Finite State Transducers

Hindi Speech Recognition System Using Htk

عنوان ژورنال:

اشتراک گذاری